Automatic Transcription of Meetings Using Topic-oriented Language Model Adaptation
نویسندگان
چکیده
This paper presents an automatic speech recognition (ASR) system dedicated for meetings of the National Congress of Japan. The distinctive features of the congressional meeting speech are wide distribution and frequent change of topics. For more accurate transcription, such topics should be emphasized in a language model one after another. Therefore, we introduce two approaches for topic adaptation: PLSA-based approach and trigger-based approach. The PLSA-based adaptation is performed turn by turn to emphasize topics in individual pair of a question and answer. Since topics are treated in a probabilistic manner, robust adaptation is realized. On the other hand, the trigger-based adaptation stresses relevant words to the word history, thus long-distance context can be reflected into a language model. These two approaches were evaluated on real meetings of the Congress, and significant improvement of perplexity was obtained by both approaches. We also compared their effects on reduction of word error rates.
منابع مشابه
Trigger-based language model adaptation for automatic meeting transcription
We present a novel trigger-based language model adaptation method oriented to the transcription of meetings. In meetings, the topic is focused and consistent throughout the whole session, therefore keywords can be correlated over long distances. The trigger-based language model is designed to capture such long-distance dependencies, but it is typically constructed from a large corpus, which is ...
متن کاملLanguage Model Adaptation Based on PLSA of Topics and Speakers for Automatic Transcription of Panel Discussions
Appropriate language modeling is one of the major issues for automatic transcription of spontaneous speech. We propose an adaptation method for statistical language models based on both topic and speaker characteristics. This approach is applied for automatic transcription of meetings and panel discussions, in which multiple participants speak on a given topic in their own speaking style. A bas...
متن کاملUnsupervised language model adaptation based on topic and role information in multiparty meetings
We continue our previous work on the modeling of topic and role information from multiparty meetings using a hierarchical Dirichlet process (HDP), in the context of language model adaptation. In this paper we focus on three problems: 1) an empirical analysis of the HDP as a nonparametric topic model; 2) the mismatch problem of vocabularies of the baseline n-gram model and the HDP; and 3) an aut...
متن کاملImplicitly Supervised Language Model Adaptation for Meeting Transcription
We describe the use of meeting metadata, acquired using a computerized meeting organization and note-taking system, to improve automatic transcription of meetings. By applying a two-step language model adaptation process based on notes and agenda items, we were able to reduce perplexity by 9% and word error rate by 4% relative on a set of ten meetings recorded in-house. This approach can be use...
متن کاملLanguage model adaptation based on PLSA of topics and speakers
We address an adaptation method of statistical language models to topics and speaker characteristics for automatic transcription of meetings and discussions. A baseline language model is a mixture of two models, which are trained with different corpora covering various topics and speakers, respectively. Then, probabilistic latent semantic analysis (PLSA) is performed on the same respective corp...
متن کامل